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DETAILED ACTION 
Continued Examination Under 37 CFR 1.114 

1 . A request for continued examination under 37 CFR 1.114, including the fee set 
forth in 37 CFR 1.17(e), was filed in this application after final rejection. Since this 
application is eligible for continued examination under 37 CFR 1.114, and the fee set 
forth in 37 CFR 1.17(e) has been timely paid, the finality of the previous Office action 
has been withdrawn pursuant to 37 CFR 1.114. Applicant's submission filed on 
1/19/2007 has been entered. 

Claim Rejections - 35 USC § 101 

2. 35 U.S.C. 101 reads as follows: 

Whoever invents or discovers any new and useful process, machine, manufacture, or composition of 
matter, or any new and useful improvement thereof, may obtain a patent therefor, subject to the 
conditions and requirements of this title. 

Claims 1-7, 9-20, and 22-26 are rejected under 35 U.S.C. 101 as being directed to non- 
statutory subject matter. The language of the claims raises a question as to whether the claims 
are directed merely to an environment or machine which would result in a practical application 
producing a concrete useful, and tangible result to form the basis of statutory subject matter 
under 35 U.S.C. 101. 

Claims 1-7, 9-20, and 22-26 are rejected because they do not recite a practical 



Application/Control Number: 10/643,628 Page 3 

Art Unit: 2166 

application by producing a physical transformation or producing a useful, concrete, and 
tangible results. To perform a physical transformation, the claimed invention must transform an 
article of physical object into a different state or thing. Transformation of data is not a physical 
transformation. A useful, concrete, and tangible results must be either specifically recited in 
the claim or flow inherently therefrom. To be useful the claimed invention must establish a 
specific, substantial, and credible utility. To be concrete the claimed invention must be able to 
produce reproducible results. To be tangible the claimed invention must produce a practical 
application or real world result. 

To expedite a complete examination of the instant application the claims rejected 
under U.S.C. 101 (nonstatutory) above are further rejected as set forth below in 
anticipation of application amending these claims to place them within the four 
categories of invention. 

Claim Rejections - 35 USC § 103 

3. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 
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Claims 1-7, 9-20, and 22-26 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Agrawal et al. (Agrawal hereinafter) (U.S. Patent No. 6,324,533) in 
view of Roberto Javier Bayardo. (Bayardo hereinafter) (U.S Patent No. 6,138,1 17). 

With respect to claim 1, Agrawal teaches a method for performing a frequent 
itemset operation, the method comprising the steps of: 

"within a database server, receiving a database statement that specifies 
frequency criteria and additional criteria and performing said frequent itemset 
operation as part of execution of the database statement to produce results, 
wherein the results include frequent itemsets that satisfy both said frequency 
criteria and said additional criteria, and wherein the results do not include 
frequent itemsets that satisfy said frequency criteria but do not satisfy said 
additional criteria" as the frequent (n+2)-itemsets are determined using cascaded 
subqueries by: a) selecting distinct first items in the candidate itemsets using a 
subquery (Agrawal Col 3, Lines 2-4). Using the results of the last subqueries to 
determine which of the (n+2)-itemsets are frequent. In generating rules from the union 
of the frequent itemsets, all items from the frequent itemsets are first put into a table F. 
A set of candidate rules is created from the table Fusing a table function. These 
candidate rules are joined with the table F, and filtered to remove those that do not meet 
a confidence criteria (Agrawal Col 3, Lines 9-16). 

F consists of k+2 attributes (item. sub. 1, . . . , item. sub. k, support, len), where k is 
the size of the largest frequent itemset and len is the length of the itemset (Agrawal Col 
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8, Lines 4-6). Sequence of operations can be implemented as a single SQL query for 
any k, as shown in FIG. 12. Therefore the query specifies both the frequency criteria 
and the additional criteria k, which is the size of an itemset. 

"wherein said frequency criteria specifies at least one criterion that related 
to how frequently combination of items appear together" as to find all combinations 
of items whose support is greater than minimum support. Call those combinations 
frequency itemsets (Agrawal Col 5, Lines 20-23). 

Agrawal teaches the elements of claim 1 as noted above but does not explicitly 
disclose, "wherein said additional criteria do not specify any criterion that related 
to how frequently combinations of items appear together." 

However, Bayardo discloses, "wherein said additional criteria do not specify 
any criterion that relates to how frequently combinations of items appear 
together" as Max-Miner usually performs less database passes than this bound in 
practice when the longest frequent itemsets are more than 10 in length (Bayardo Col 9, 
Lines 57-60). Examiner interprets the length of 10 as additional criteria. 

It is still another object of the present invention to quickly identify those patterns 
that are both frequent and maximal so that the set of maximal frequent patterns 
represents the set of all frequent patterns (Bayardo Col 3, Lines 32-35 and Lines 40- 
56). 

It would have been obvious to one of ordinary skill in the art at the time the 
invention was made to combine the teaching of the cited references because 
Bayardo's teachings would have allowed Agrawal to provide an efficient method for 
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extracting relatively long frequent patterns from a database of transaction records where 
each record includes several data items. 

With respect to claim 2, Agrawal teaches "the method of claim 1, wherein the 
database statement is expressed in a particular database language, and wherein 
the particular database language is SQL" as a method for mining data relationships 
from the integrated mining system in the form of queries to SQL engines enhanced with 
object-relational extensions (SQL-OR), such as user-defined functions (UDFs) and table 
functions (Agrawal Col 2, Lines 33-36). 

With respect to claim 3, Agrawal teaches "the method of claim 1, wherein the 
frequency criteria and the additional criteria are identified by a construct, and 
wherein the construct is a table function" as a method for mining data relationships 
from the integrated mining system in the form of queries to SQL engines enhanced with 
object-relational extensions (SQL-OR), such as user-defined functions (UDFs) and table 
functions (Agrawal Col 2, Lines 33-36). Examiner interpreted the table functions as 
construct. 

With respect to claim 4, Agrawal teaches the method of claim 1 wherein: 
"the database statement includes a first indication of a first input format" 

as the data table is first transformed into a vertical format by creating for each item a 
BLOB containing all tids that contain that item (Tid-list creation phase) and then count 
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the support of itemsets by merging together these tid-lists (support counting phase) 
(Agrawal Col 12, Lines 43-47). 

"the frequent itemset operation operates on input that conforms to said 
first input format" as a table function Gather is used for creating the Tid-lists. This is 
the same as the Gather function in GatherJoin except here, the tid-list is created for 
each frequent item. The data table T is scanned in the (item, tid) order and passed to 
the function Gather. The function collects the tids of all tuples of T with the same item in 
memory and outputs a (item, tid-list) tuple for items that meet the minimum support 
criterion. The tid-lists are represented as BLOBs and stored in a new TidTable with 
attributes (item, tid-list) (Agrawal Col 12, Lines 48-56). 

"the method further comprises the steps of: parsing a second database 
statement to detect within the second database statement the construct that 
extends a database language" as a method for mining data in an integrated database 
and data-mining system. Start with step 30, a group-by query is performed on the data 
transactions to generate a set of frequent 1 -itemsets. One-itemsets are those having 
exactly one item each, while an itemset is frequent if the number of transactions 
containing it is at least at a specified number. At step 31, frequent 2-itemsets are 
determined from the frequent 1 -itemsets and the transaction table. A candidate set of 
(n+2)-itemsets is next generated in step 32 from the frequent (n+1)-itemsets, where 
n=1 . At step 33, frequent (n+2)-itemsets are generated from the candidate set of (n+2)- 
itemsets and the transaction table using a query (Agrawal Col 6, Lines 43-55). A first 
query is being performed to generate 1 -itemsets, and (n+2) itemsets are being 
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generated using another query, "wherein the second database statement includes a 
second indication of a second input format that is different from said first input 
format" as a horizontal format where each tid is followed by a collection of all its items 
(Agrawal Col 10, Lines 37-38). 

"in response to detection of said construct in said second database 
statement, the database server performing a second frequent itemset operation 
as part of execution of the second database statement" as the mining operation is 
expressed in some extension of SQL or a graphical language, which are input to 
preprocessor 21 . This preprocessor generates appropriate SQL translations for the 
mining operation. For example, these SQL translations may be those that are executed 
by a SQL-92 relational engine 22. It is assumed that blobs, user-defined functions, and 
table functions are available in the object-relational engine. The mining results might be 
output to a depository 24 (Agrawal Col 6, Lines 26-42). "wherein the second frequent 
itemset operation operates on input that conforms to said second format" as K- 
way Join approach where the k-way self join of T is replaced with the table functions 
Gather and Comb-K. It is possible to merge these functions together as a single table 
function GatherComb-K. The Gather function is not required when the data is already in 
a horizontal format where each tid is followed by a collection of all its items (Agrawal 
Col 10, Lines 33-38). 

With respect to claim 5, Agrawal teaches "the method of claim 4 wherein the 
first indication is identification of a first table function" as a table function Gather is 
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used for creating the Tid-lists. This is the same as the Gather function in GatherJoin 
except here, the tid-list is created for each frequent item. The data table T is scanned in 
the (item, tid) order and passed to the function Gather. The function collects the tids of 
all tuples of T with the same item in memory and outputs a (item, tid-list) tuple for items 
that meet the minimum support criterion (Agrawal Col 12, Lines 48-56). "and the 
second indication is identification of a second table function" as the output of 
Gather is passed to another table function Comb-K which returns all k-item 
combinations formed out of the items of a transaction (Agrawal Col 10, Lines 24-27). 

With respect to claim 6, Agrawal teaches "the method of claim 1 wherein the 
frequent itemset operation uses, as input, a row source that is generated during 
execution of other operations specified in said database statement" as output is a 
collection of rules of varying length. The maximum length of these rules is much 
smaller than the number of items and is rarely more than a dozen. Therefore, a rule is 
represented as a tuple in a fixed-width table where the extra column values are set to 
NULL to accommodate rules involving smaller itemsets. The schema of a rule is 
(item. sub. 1, . . . , item. sub. k, len, rulem, confidence, support) where k is the size of the 
largest frequent itemset (Agrawal Col 5, Lines 65-67 & Col 6, Lines 1-6). A table 
function, GenRules, is used to generate all possible rules from a frequent itemset. The 
input to the function is a frequent itemset. For each itemset, it outputs tuples 
corresponding to rules with all non-empty proper subsets of the itemset in the 
consequent. The table function outputs tuples with k+3 attributes, Tjtem.sub.1 , . . . , 
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Tjtem.sub.k, T_support, T_ten, T_rulem (Agrawal Col 8, Lines 7-13). From first 
operation a row/tuple is being obtained, which is then being used as an input 

With respect to claim 7, Agrawal teaches "the method of claim 1 wherein the 
frequent itemset operation produces, as output, a row source that is used as 
input for other operations specified in said database statement" as output is a 
collection of rules of varying length. The maximum length of these rules is much 
smaller than the number of items and is rarely more than a dozen. Therefore, a rule is 
represented as a tuple in a fixed-width table where the extra column values are set to 
NULL to accommodate rules involving smaller itemsets. The schema of a rule is 
(item. sub. 1, . . . , item. sub. k, len, rulem, confidence, support) where k is the size of the 
largest frequent itemset (Agrawal Col 5, Lines 65-67 & Col 6, Lines 1-6). A table 
function, GenRules, is used to generate all possible rules from a frequent itemset. The 
input to the function is a frequent itemset. For each itemset, it outputs tuples 
corresponding to rules with all non-empty proper subsets of the itemset in the 
consequent. The table function outputs tuples with k+3 attributes, Tjtem.sub.1 , . . . , 
Tjtem.sub.k, T_support, T_ten, T__rulem (Agrawal Col 8, Lines 7-13). From first 
operation a row/tuple is being obtained as an output, which is then being used as an 
input. 

With respect to claim 9, Agrawal teaches "the method of claim 1 wherein: the 
additional criteria specify a minimum length; and the step of performing the 
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frequent itemset operation includes performing a frequent itemset operation 
whose results exclude all item sets that include fewer items than the minimum 
length specified by the additional criteria" as combinations of items whose support 
is greater than minimum support. Call those combinations frequent itemsets (Agrawal 
Col 5, Lines 21-23). The function collects the tids of all tuples of T with the same item in 
memory and outputs a (item, tid-list) tuple for items that meet the minimum support 
criterion (Agrawal Col 12, Lines 52-55). 

Agrawal further teaches the function collects the tids of all tuples of T with the . 
same item in memory and outputs a (item, tid-list) tuple for items that meet the minimum 
support criterion. The tid-lists are represented as BLOBs and stored in a new TidTable 
with attributes (item, tid-list) (Agrawal Col 11, Lines 49-56). 

Agrawal teaches the elements of claim 9 as noted above but does not explicitly 
teaches "a minimum length." 

However, Bayardo teaches "a minimum length" as Max-Miner usually 
performs less database passes than this bound in practice when the longest .frequent 
itemsets are more than 10 in length (Bayardo Col 9, Lines 57-60). Examiner interprets 
the length of 10 as the minimum length. 

It would have been obvious to one of ordinary skill in the art at the time the 
invention was made to combine the teaching of the cited references because 
Bayardo's teachings would have allowed Agrawal to provide an efficient method for 
extracting relatively long frequent patterns from a database of transaction records where 
each record includes several data items. 
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With respect to claim 10, Agrawal teaches "the method of claim 1 wherein: 
the additional criteria specify a maximum length; and the step of performing the 
frequent itemset operation includes performing a frequent itemset operation 
whose results exclude all item sets that include more items than the maximum 
length specified by the additional criteria" as F consists of k+2 attributes (item. sub. 1 , 
. . . , item. sub. k, support, len), where k is the size of the largest frequent itemset and 
len is the length of the itemset (Agrawal Col 8, Lines 4-6), 

Agrawal further teaches in particular, it is not practical to assume that all items in 
a transaction appear as different columns of a single tuple because often the number of 
items per transaction can be more than the maximum number of columns that the 
database supports. For instance, for one of our real-life datasets the maximum number 
of items per transaction is 872 and for another it is 700 (Agrawal Col 5, Lines 56-60). 

Agrawal teaches the elements of claim 10 as noted above but does not explicitly 
teaches "a maximum length." 

However, Bayardo discloses "a maximum length" as the most part, frequent- 
pattern mining methods have been developed to operate on databases in which the 
longest frequent patterns are relatively short, e.g., those with less than 10 items 
(Bayardo Col 1 , Lines 22-26). Examiner interprets the length of 10 as the maximum 
length. 

It would have been obvious to one of ordinary skill in the art at the time the 
invention was made to combine the teaching of the cited references because 
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Bayardo's teachings would have allowed Agrawal to provide an efficient method for 
extracting relatively long frequent patterns from a database of transaction records where 
each record includes several data items. 

With respect to claim 1 1 , Agrawal teaches "the method of claim 1 wherein: 
the additional criteria specify a set of one or more included items; and the step of 
performing the frequent itemset operation includes performing a frequent itemset 
operation whose results exclude all itemsets that do not include all items in said 
set of one or more included items" as the frequent (n+2)-itemsets are determined 
using cascaded subqueries by: a) selecting distinct first items in the candidate itemsets 
using a subquery. In generating rules from the union of the frequent itemsets, all items 
from the frequent itemsets are first put into a table F. These candidate rules are joined 
with the table F, and filtered to remove those that do not meet a confidence criteria 
(Agrawal Col 3, Lines 2-16). 

Agrawal teaches the elements of claim 1 1 as noted above but does not explicitly 
teaches "one or more included items." 

However, Bayardo discloses "one or more included items" as a method for 
identifying patterns from a database of records including the steps of: (1) generating an 
initial set C of candidates where each candidate c includes two distinct sets of items: 
c.head and c.tail (Bayardo Col 3, Lines 42-45). 

It would have been obvious to one of ordinary skill in the art at the time the 
invention was made to combine the teaching of the cited references because 



Application/Control Number: 10/643,628 Page 14 

Art Unit: 2166 

Bayardo's teachings would have allowed Agrawal to provide an efficient method for 
extracting relatively long frequent patterns from a database of transaction records where 
each record includes several data items. 

With respect to claim 12, Agrawa! teaches "the method of claim 1 wherein the 
step of performing the frequent itemset operation includes performing a frequent 
itemset operation whose results identify frequent itemsets, and for each of the 
frequent itemsets, a count of how many item groups included the frequent 
itemset" as a set of frequent 1 -itemsets is generated using a group-by query on data 
transactions. From these frequent 1 -itemsets and the transactions, frequent 2-itemsets 
are determined. A candidate set of (n+2)-itemsets are generated from the frequent 2- 
itemsets, where n=1. Frequent (n+2)-itemsets are determined from candidate set and 
the transaction table using a query operation (Agrawal Abstract). 

With respect to claim 13, Agrawal teaches "the method of claim 1 wherein the 
step of performing the frequent itemset operation includes performing a frequent 
itemset operation whose results identify frequent itemsets, and for each of the 
frequent itemsets, a count of how items are in the frequent itemset" as a set of 

frequent 1 -itemsets is generated using a group-by query on data transactions (Agrawal 
Abstract). The support counting phase, conceptually for each itemset in C.sub.k the tid- 
lists of all k items are collected and the number of tids in the intersection of these k lists 
is counted using a user defined function (UDF) (Agrawal Col 12, Lines 56-59). 
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Group of claims 14-20 and 22-26 is essentially the same as group of claims 1-7, 
9-20 except they set forth the claimed invention as a computer readable media carrying 
instructions, and are rejected for the same reasons as applied hereinabove. 

Response to Arguments 

4. Applicant's arguments have been considered but are moot in view of the new 
ground(s) of rejection. 

See above rejections for response to the arguments. 

Conclusion 

5. The prior art made of record and not relied upon is considered pertinent to applicant's 
disclosure is listed on 892 form. 

Examiner's Note: Examiner has cited particular figures, columns and line numbers in 
the references as applied to the claims above for the convenience of the applicant. Although 
the specified citations are representative of the teachings in the art and are applied to the 
specific limitations within the individual claim, other passages and figures may apply as well. It 
is respectfully requested from the applicant, in preparing the responses, to fully consider the 
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references in entirety as potentially teaching all or part of the claimed invention, as well as the 
context of the passage as taught by the prior art or disclosed by the examiner. 

Contact Information 

6. Any inquiry concerning this communication or earlier communications from the 

i 

examiner should be directed to Usmaan Saeed whose telephone number is (571)272-4046. 
The examiner can normally be reached on M-F 8-5. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Hosain Alam can be reached on (571)272-3978. The fax phone number for the 
organization where this application or proceeding is assigned is 571-273-8300. 

Information regarding the status of an application may be obtained from the Patent 
Application Information Retrieval (PAIR) system. Status information for published applications 
may be obtained from either Private PAIR or Public PAIR. Status information for unpublished 
applications is available through Private PAIR only. For more information about the PAIR 
system, see http://pair-direct.uspto.gov. Should you have questions on access to the Private 
PAIR system, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free). 
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